NASTyLinker: NIL-Aware Scalable Transformer-Based Entity Linker

نویسندگان

چکیده

Entity Linking (EL) is the task of detecting mentions entities in text and disambiguating them to a reference knowledge base. Most prevalent EL approaches assume that base complete. In practice, however, it necessary deal with case linking an entity not contained (NIL entity). Recent works have shown that, instead focusing only on affinities between entities, considering inter-mention can be used represent NIL by producing clusters mentions. At same time, help substantially improve performance for known entities. With NASTyLinker, we introduce approach aware produces corresponding mention while maintaining high The based dense representations from Transformers resolves conflicts (if more than one assigned cluster) computing transitive mention-entity affinities. We show effectiveness scalability NASTyLinker NILK, dataset explicitly constructed evaluate respect Further, apply presented actual task, namely graph population Wikipedia listings, provide analysis outcome.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Time-Aware Entity Linking

Entity Linking is the task of automatically identifying entity mentions in a piece of text and linking them to their corresponding entries in a reference knowledge base like Wikipedia. Although there is a plethora of works on entity linking, existing state-of-the-art approaches do not explicitly consider the time aspect and specifically the temporality of an entity’s prior probability (populari...

متن کامل

Scalable Link-based Personalization for Ranking in Entity-Relationship Graphs

Authority flow techniques like PageRank and ObjectRank can provide personalized ranking of typed entity-relationship graphs. There are two main ways to personalize authority flow ranking: Nodebased personalization, where authority originates from a set of userspecific nodes; Edge-based personalization, where the importance of different edge types is user-specific. We propose for the first time ...

متن کامل

Scalable Probabilistic Entity-Topic Modeling

We present an LDA approach to entity disambiguation. Each topic is associated with a Wikipedia article and topics generate either content words or entity mentions. Training such models is challenging because of the topic and vocabulary size, both in the millions. We tackle these problems using a novel distributed inference and representation framework based on a parallel Gibbs sampler guided by...

متن کامل

A Pruning Based Approach for Scalable Entity Coreference

Entity coreference is the process to decide which identifiers (e.g., person names, locations, ontology instances, etc.) refer to the same real world entity. In the Semantic Web, entity coreference can be used to detect equivalence relationships between heterogeneous Semantic Web datasets to explicitly link coreferent ontology instances via the owl:sameAs property. Due to the large scale of Sema...

متن کامل

Towards Scalable Real-Time Entity Resolution using a Similarity-Aware Inverted Index Approach

Most research into entity resolution (also known as record linkage or data matching) has concentrated on the quality of the matching results. In this paper, we focus on matching time and scalability, with the aim to achieve large-scale real-time entity resolution. Traditional entity resolution techniques have assumed the matching of two static databases. In our networked and online world, howev...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2023

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-33455-9_11